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TITLE OF THE INVENTION 

METHODS FOR OBTAINING THERMOSTABLE ENZYMES, DNA POLYMERASE I 
VARIANTS FROM THERMUS AQUATICUS HAVING CATALYTIC ACTIVITIES, 

METHODS FOR OBTAINING THE SAME, AND APPLICATIONS OF THE SAME 

CROSS-REFERENCE TO RELATED APPLICATIONS 
Not Applicable. 

BACKGROUND OF THE INVENTION 

Field of the Invention 

The present invention provides a method for obtaining thermostable enzymes. The 
present invention also provides variants of DNA polymerase I from Thermus aquaticus. The 
present invention further provides methods of identifying mutant DNA polymerases having 
enhanced catalytic activity. The present invention also provides polynucleotides, expression 
systems, and host cells encoding the mutant DNA polymerases. Still further, the present 
invention provides a method to carry out reverse transcriptase-polymerase chain reaction 
(RT-PCR) and kits to facilitate the same. 

Discussion of the Background 

Filamentous phage display is commonly used as a method to establish a link between 
a protein expressed as a fusion with a phage coat protein and its corresponding gene located 
within the phage particle (Marks et al., J. Biol Chem. (1992) 267, 16007-16010). The use of 
filamentous phage particles as a chemical reagent provides further a strategy to create a 
complex between an enzyme, its gene and a substrate (Jestin et al., Angew. Chem, InU Ed. 
(1999) 38, 1 124-1 127). This substrate can be cross-linked on the surface of filamentous 
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phage using the nucleophiHc properties of coat proteins. If the enzyme is active, conversion 
of the substrate to the product yields a phage particle cross-linked with the product, which 
can be captured by affinity chromatography (see discussion in Vichier-Guerre & Jestin, 
Biocat. &Biotransf. (2003) 21, 75-78). 

Several similar approaches based on product formation for the isolation of genes 
encoding enzymes using phage display have been described in the literature for various 
enzymes (Fastrez et al., (2002) In: Brackmann, S. and Johnsson, K. eds., Directed Molecular 
Evolution of Proteins (Wiley VCH, Weinheim), pp 79-1 10). These in vitro selections of 
proteins for catalytic activity are well suited for use with large repertoires of about 10^ 
proteins or more. Several libraries of enzyme variants on phage have been constructed and 
catalytically active proteins with wild type like activities have been isolated (Atwell & Wells 
(1999) Proc. Natl Acad. Sci, USA 96, 9497-9502; Heinis et al. (2001) Prot. Eng. 14, 1043- 
1052; Ponsard et al. (2001) Chembiochem, 2, 253-259; Ting et al. (2001) BiopoL 60, 220- 
228.). Mutants with different substrate specificities have been also obtained (Xia et al. 
(2002) Proa Natl Acad. Sci. USA 99, 6597-6602.). In these studies, the fraction of active 
variants in the libraries can be large and it remains unclear how rare an enzyme can be in the 
initial protein library so as to be selected after iterative selection cycles. Accordingly, there 
remains a critical need for an efficient process for making and identifying thermostable 
enzymes possessing a desired catalytic activity. 

Reverse transcriptases are enzymes that are present generally in certain animal viruses 
(i.e., retroviruses), which are used in vitro to make complementary DNA (cDNA) from an 
mRNA template. Practically, reverse transcriptases have engendered significant interest for 
their use in reverse transcriptase-polymerase chain reaction (RT-PCR). As such, these 
proteins lend themselves to be a model system for development of an efficient method of 
making thermostable enzymes having a desired activity. 
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RNA generally contains secondary structures and complex tertiary sections, 
accordingly it is highly desired that the RNA be copied in its entirety by reverse transcription 
to ensure that integrity of cDNA is maintained with high accuracy. However, due to the often 
complicated secondary and tertiary structures of RNA, the denaturation temperatures are 
generally about 90**C and, as such, the reverse transcriptase must be capable of withstanding 
these extreme conditions while maintaining catalytic efficiency. 

The classically utilized enzymes for RT-PCR have been isolated from the AMV 
(Avian myeloblastosis virus) or MMLV (Moloney murine leukemia virus); however, these 
enzymes suffer fi-om a critical limitation in that they are not thermostable. In fact, the 
maximum temperature tolerated by most commercially available reverse transcriptases is 
about 70°C. 

One conmion approach to overcome this limitation in the existing technology with the 
previously described polymerases has been the use of a protein chaperones in addition to the 
polymerase. However, this method leads to problems associated with environmental 
compatibility metal ion requirements, multi-stage procedures, and overall inconvenience. 
Accordingly, an ahemative strategy has been to use thermostable reverse transcriptases. This 
approach makes it possible to perform multiple denaturation and reverse transcription cycles 
using only a single enzyme. 

To this end, the DNA-dependent DNA polymerase I of Thermus aquaticus (i.e., Taq 
polymerase), is thermostable and has reverse transcriptase activity only in the presence of 
manganese. However, when the manganese ion concentration is maintained in the millimolar 
range the fidelity of the enzyme is affected. It has been suggested that the thermostable 
DNA-dependent DNA polymerase of Bacillus stearothermophilus has reverse transcriptase 
activity, even in absence of magnesium, but in this case it is necessary to add a thermostable 
DNA polymerase for the PGR. 
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Therefore, there remains a critical need for high efficiency, thermostable enzymes that 
are capable of catalyzing reverse transcription and subsequent DNA polymerization in "one- 
pot" RT-PCR. Accordingly, the present invention provides an isolated population of 
thermostable reverse transcriptases, which are active in absence of manganese, by directed 
evolution of the Stoffel fi-agment of the Taq polymerase. 



SUMMARY OF THE INVENTION 
It is an object of the present invention to provide a method of identifying thermostable 
mutant polypeptides having a catalytic activity by: 

a) packaging a vector in which a gene or fragment thereof encoding variants of a 
catalj^ic domain responsible for the catalytic activity fiised to a gene encoding a phage coat 
protein, 

b) isolation and purification of phage particles; 

c) heating the phage-mutant polypeptide at a temperature ranging from 50°C to 90°C 
for a time ranging firom less than 1 minute to several hours 

d) cross-linking a specific substrate with a phage particle 

e) forming a reaction product fi-om the substrate catalyzed by the thermostable mutant 
protein on phage, wherein the temperature is optionally regulated to be the same or greater or 
lower than the temperature of (c) 

f) selecting the phage particles comprising a variant nucleotidic sequence encoding 
for the catalytic domain responsible for the catalytic activity at the regulated temperature, by 
capturing the reaction product or screening for said reaction product, 

g) infecting J?, coli with the phage particles selected at step (f), 

h) incubating the infected E, coli; and 
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i) assessing catalytic activity of the proteins corresponding to isolated genes. 
It is an object of the present invention to provide a thermostable mutant DNA 
polymerase having at least 80% homology to the Stoflfel fragment (SEQ ID NO: 26) of DNA 
polymerase I obtained from Thermus aquaticus. 

To this end, the present invention provides thermostable polypeptides having at least 
80% homology to SEQ ID NO: 26, wherein said polypeptide has at least one mutation 
selected from the group consisting of a mutation in amino acids 738 to 767 of SEQ ID 
NO:26, A331T, S335N, M470K (position 747 of the Taq polymerase wild-type sequence), 
M470R (position 747 of the Taq polymerase wild-type sequence), F472Y (position 749 of the 
Taq polymerase wild-type sequence), M484V (position 761 of the Taq polymerase wild-type 
sequence), M484T (position 761 of the Taq polymerase wild-type sequence), and W550R 
(position 827 of the Taq polymerase wild-type sequence), and wherein said polypeptide has 
improved DNA polymerase activity and retains 5 '-3' exonuclease activity. In an object of 
the present invention, the 3 '-5' exonuclease activity of the mutant polypeptide is inactive. 

In an object of the present invention, the thermostable mutant DNA polymerase also 
has a mutation at one or more position selected from A331, L332, D333, Y334, and S335 of 
SEQ ID NO: 26 (positions 608-612 of the Taq polymerase wild-type sequence). 

In a particular object of the present invention, the mutant DNA polymerase has one of 
the following sequences: SEQ ID NO: 20, SEQ ID NO: 22, SEQ ID NO: 24, SEQ ID NO: 28, 
SEQ ID NO: 30, SEQ ID NO: 32, SEQ ID NO: 34, SEQ ID NO: 36, and SEQ ID NO: 38. 

Further, in another object of the present invention are polynucleotides that encode for 
the aforementioned thermostable mutant DNA polymerases. 

In yet another object of the present invention is a kit for DNA amplification, which 
contains: (a) one or more of the aforementioned thermostable mutant DNA polymerases; (b) 
a concentrated buffer solution, wherein when said concentrated buffer is admixed with the 
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isolated polypeptide the overall buffer concentration is IX; (c) one or more divalent metal ion 
(e.g., Mg^"" or Mn^"^); and (d) deoxyribonucleotides. 

In yet another object of the present invention is a method of reverse transcribing an 
RNA by utilizing the inventive thermostable mutant DNA polymerases. 

In still a further object of the present invention is a phage-display method for 
identifying thermostable mutant DNA polymerases in which the Stoffel fragment has been 
mutated, while the DNA polymerase activity and 5 '-3' exonuclease activity has been 
maintained and/or enhanced. 

The above objects highlight certain aspects of the invention. Additional objects, 
aspects and embodiments of the invention are found in the following detailed description of 
the invention. 



BRIEF DESCRIPTION OF THE FIGURES 

A more complete appreciation of the invention and many of the attendant advantages 
thereof will be readily obtained as the same becomes better understood by reference to the 
following Figures in conjunction with the detailed description below. 

Figure 1 shows the reverse transcriptase activity of phage-polymerases assessed as 
obtained after different rounds of selection in the presence of Mg^"^ or Mn^^ ions. The lane 
labels correspond to the following: 
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MaCh 

a : phage-polymerases of round 6 

b : phage-polymerases of round 5 

c : phage-polymerases of round 4 

d : phage-polymerases of round 3 

e : phage-polymerases of round 2 

f : phage-polymerases of round 1 

g : phage-polymerases of initial population 



MgCl9 

h: phage-polymerases of round 6 

i : phage-polymerases of round 5 

j : phage-polymerases of roxmd 4 

k : phage-polymerases of round 3 

1 : phage-polymerases of round 2 

m : phage-polymerases of round 1 

n : phage-polymerases of initial population 



Figure 2 shows the reverse transcriptase activity of phage-polymerases assessed as 
obtained after different rounds of selection in the presence of Mg^"" ions. The lane 
designations in Figure 2 are as follows: 



Phage-polymerase heated at 65 °C for 5 min. 

a : phage-polymerases of initial population 

b : phage-polymerases of round 1 

c : phage-polymerases of round 2 

d : phage-polymerases of round 3 

e : phage-polymerases of round 4 

f : phage-polymerases of round 5 

g : phage-polymerases of round 6 



Phage-polymerase not preheated 



h : phage-polymerases of initial population 

i : phage-polymerases of round 1 

j : phage-polymerases of round 2 

k : phage-polymerases of round 3 

1 : phage-polymerases of round 4 

m : phage-polymerases of round 5 

n : phage-polymerases of round 6 

o : control AMV-RT, 1 U 

p : control AMV-RT, 0.1 U 

q : control AMV-RT, 0.01 U 

r : control AMV-RT, 0.001 U 



Figure 3 shows the reverse transcriptase activity of various monoclonal phage- 
polymerases obtained after round 6 in the presence of Mg^"" ions. The lane designations in 
Figure 3 are as follows: s = SEQ ID NO: 38; a = SEQ ID NO: 20; d = SEQ ID NO: 24; g = 
SEQ ID NO: 28; C = AMV-RT; i = SEQ ID NO: 30; m = SEQ ID NO: 32; n = SEQ ID NO: 
34; b = SEQ ID NO: 22; and q = SEQ ID NO: 36. 



Figure 4 shows the reverse transcriptase activities and the polymerase activities of 
monoclonal phage-polymerases obtained after the round 6 in the presence of Mg^"" or Mn^"" 
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ions. The lane designations in Figure 4 are as follows: a = SEQ ID NO: 20; b = SEQ ID NO: 
22; d = SEQ ID NO: 24; and e = SEQ ID NO: 26. 

Figure 5 shows purified mutant RT-polymerases a, b, and d used in polymerase chain 
reaction. The lanes in the gel appearing in Figure 5 include the three clones corresponding on 
clones a, b and d on Figure 4. In addition, the positive control was perfonned using the 
Stofifel fragment polymerase e and commercially Taq polymerase (Promega). The lanes in 
Figure 5 are as follows: 

lane 1 : Taq 

lane 2 : a = SEQ ID NO: 20 
lane 3 : b = SEQ ID NO: 22 
lane 4 : d = SEQ ID NO: 24 
lane 5: e = SEQ ID NO: 26 
lane 6 : Molecular weight marker 

Figure 6 shows purified mutant RT-polymerases a, b, and d used in RT-polymerase 

chain reaction. The lanes in the gel appearing in Figure 6 include the three clones 

corresponding to clones a, b and d on Figure 4. In addition, the positive control was 

performed using the Stoffel fragment polymerase e and the phage-polymerase of AMV-RT 

(Promega). The lanes in Figure 6 are as follows: 

lane 1 : : molecular weight marker 
lane 2 : control AMV-RT 
lane 3 : b = SEQ ID NO: 22 
lane 4 : a = SEQ ID NO: 20 
lane 5: e = SEQ ID NO: 26 
lane 6 : d = SEQ ID NO: 24 
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DETAILED DESCRIPTION OF THE INVENTION 
Unless specifically defined, all technical and scientific terms used herein have the 
same meaning as commonly understood by a skilled artisan in enzymology, biochemistry, 
cellular biology, molecular biology, and the medical sciences. 

All methods and materials similar or equivalent to those described herein can be used 
in the practice or testing of the present invention, with suitable methods and materials being 
described herein. All publications, patent applications, patents, and other references 
mentioned herein are incorporated by reference in their entirety. In case of conflict, the 
present specification, including definitions, will control. Further, the materials, methods, and 
examples are illustrative only and are not intended to be limiting, unless otherwise specified. 

The present invention provides a method of identifying thermostable mutant 
polypeptides having a catalytic activity comprising: 

a) packaging a vector in which a gene or fi-agment thereof encoding variants of a 
catalytic domain responsible for the catalytic activity fused to a gene encoding a phage coat 
protein, 

b) isolation and purification of phage particles; 

c) heating the phage-mutant polypeptide at a temperature ranging fi-om 50°C to 90°C, 
preferably fi-om 55«C to 65^C, more preferably at 65°C for a time ranging from 30 seconds to 
several hours, preferably firom 1 minute to 3 hours, more preferably fi-om 5 minutes to 2 
hours, most preferably 10 minutes to 1 hour 

d) cross-linking a specific substrate with a phage particle 

e) forming a reaction product fi-om the substrate catalyzed by the thermostable mutant 
polypeptide on phage, wherein the temperature is optionally regulated to be the same or 
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greater or lower than the temperature of (c) (i.e., from 25°C to TCC, preferably from 37°C to 
70°C and more preferably at 65°C). 

f) selecting the phage particles comprising a variant nucleotidic sequence encoding 
for the catalytic domain responsible for the catalytic activity at the regulated temperature, by 
capturing the reaction product or screening for said reaction product, 

g) infecting E. coli with phage particles selected at (f) 

h) incubating the infected E. coli; and 

i) assessing catalytic activity of the proteins corresponding to isolated genes. 
In the embodiment above, the gene or fragment thereof encoding variants of a 

catalytic domain may be directly or indirectly ftised to the gene encoding a phage coat 

protein. When the gene or fragment thereof encoding variants of a catalytic domain and the 

gene encoding a phage coat protein are indirectly fused it is preferred that the ftision be 

through a peptide or polypeptide linker. 

Within this above-recited embodiment, steps (a) to (h) may be repeated 0 to 20 times, 

preferably 1 to 15 times, more preferably 2 to 10 times, most preferably 3 to 7 times 

The method comprising a single cycle (repeated 0 times) is particularly adapted to 

high throughput screening, when steps are repeated from 3 to 7 times, the method is better 

adapted for classical empirical screening. 

The peptide utilized within this embodiment is selected from the group consisting of: 
a flexible linker such as a glycine rich linker such as (SG4)n (SEQ ID NO: 39), 
Human cahnodulin (SEQ ID NO: 46, the DNA encoding SEQ ID NO:46 is SEQ ID 
NO:56), and 

Hexahistidine binding single chain variable fragment {Grutter M.G., J. Mol. Biol. 2002, 318. 
J 35-147.) consisting of 

(i) Anti-His Tag Antibody 3D5 Variable Heavy Chain (SEQ ID NO: 47) 
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(ii) Linker (SEQ ID NO: 48) 

(iii) Anti-His Tag Antibody 3D5 Variable Light Chain (SEQ ID NO: 49). 

Moreover, the polypeptide linker is selected from the group consisting of: any protein 
binding the substrate at high temperature, any catalytic domain such as exonuclease 5' to 3' 
(from Thermus thermophilus, SEQ ID NO: 50), or 3' to 5 '(from E. coli, SEQ ID NO: 51), 
Catalytic domain of Bacillus circulans cyclodextringlycosyltransferase (SEQ ID NO: 52, the 
DNA is in SEQ ID NO:57), Catalytic domain of Bordetella pertussis adenylate cyclase(SEQ 
ID NO: 53-the DNA is in SEQ ID NO:58), Bacillus amyloliquefaciens serine protease 
subtilisin (SEQ ID NO: 54— the DNA is in SEQ ID NO:59), and Catalytic domain of 
Bacillus subtilis lipase A (SEQ ID NO: 55, Quax W.J. 2003, 101. 19-28 J. Biotechnol). 

As used in the present invention, the cross-linking between the specific substrate of 
the catalytic domain of the polypeptide with the phage particle is made by a cross-linking 
agent selected from the group consisting of a: maleimidyl group, iodoacetyl group, disulfide 
derivative and any other thermostable link (conducting to a stable protein-protein interaction 
or protein-molecule interaction). 

In a preferred embodiment, the catalytic domain may be the catalytic domain of an 
enzyme selected from the group consisting of: a polymerase, an alpha-amylase (substrate 
such as starch), a lipase (substrate such as ester), a protease (modified or not modified peptide 
or polypeptide as substrate), a cyclodextringlycosyltransferase, and an adenylate cyclase. 

In another embodiment, the assessment of the catalytic activity of step (f) is made by 
means of a DNA polymerization. 

In yet another embodiment of the present invention, step (b) may be performed after 
(e) of cross-linking or during (h) of assesing catalytic activity. 

As a general method for the isolation of thermostable enzymes and their genes the 
following should be noted: 
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First, the gene encoding variants of a catalytic domain are fused to the gene encoding 
a phage coat protein (such as filamentous phage g3, g6, g7, g9 or g8 protein or of other 
phage/virus particles) either directly or using a peptide or polypeptide linker such as a short 
peptide sequence or a protein or a protein domain. These genes encoding phage coat proteins 
may be fused either at the 3' or at the 5' terminus depending on whether the N- or the C- 
termini of the proteins are located on the outside of the particle. 

This is done either using a phage vector or a phagemid vector used with a helper 

phage. 

Second, the phage- variant enzymes may be heated at a preferred temperature of 65*C 
for 1 minute or for several hours as appropriate. This step can be performed before or after 
the substrate cross-linking (maleimidyl group derivatised substrate (DNA primer) crosslinked 
to the phage particle) and catalysis (DNA polymerisation) steps. Catalysis is preferably at 
65°C for 2 minutes, but can be done at any temperature between 0°C and 100°C. 
Crosslinking is typically performed for 2 hours at 37°C, but can be done at other temperatures 
(higher temperature may increase maleimidyl hydrolysis versus maleimidyl phage cross- 
linking). 

It is worth noting that the link between the gene and the corresponding enzyme 
variants is unaltered by high temperatures and the phage particle are still infective and the 
genes selected can be amplified hyE. colt after infection (cf for example, Kristensen P, 
Winter G. Proteolytic selection for protein folding using filamentous bacteriophages. Fold 
Des. 1998;3(5):321-8) 

By way of example of the aforementioned embodiments, the present invention relates 
to a purified, thermostable DNA polymerase purified from TTiermus aquaticus and 
recombinant means for producing the enzyme. Thennostable DNA polymerases are usefiil in 
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many recombinant DNA techniques, especially nucleic acid amplification by the polymerase 
chain reaction (PCR) 

Directed protein-evolution strategies generally make use of a link between a protein 
and the encoding DNA. In phage-display technology, this link is provided by fusion of the 
protein with a coat-protein that is incorporated into the phage particle containing the DNA. 
Optimization of this link can be achieved by adjusting the signal sequence of the fusion. 

Linking of a gene to its corresponding polypeptide is a central step in directed protein 
evolution toward new functions. Filamentous bacteriophage particles have been extensively 
used to establish this linkage between a gene of interest and its protein expressed as a fusion 
product with a phage coat protein for incorporation into the phage particle. Libraries of 
proteins displayed on phage can be subjected to in vitro selection to isolate proteins with 
desired properties together with their genes. 

Creating a link between a gene and a single corresponding protein was achieved by 
making use of a phagemid for expression of the fusion protein and of a helper phage for 
assembly of the phage particles. This approach, yielding a monovalent display of protein, was 
found to be essential to avoid avidity effects or chelate effects, which introduce strong biases 
during in vitro selections for affinity. However, it also produces phage particles that do not 
display any protein of interest and which thereby represent a background in evolution 
experiments. 

To optimize the link between a gene and a single corresponding protein, several 
methods have been used. For example, the periplasmic factor Skp was found to improve the 
display of single-chain Fv antibodies on filamentous phage (Bothmann, H. and Pluckthun, A. 
(1998) Selection for a periplasmic factor improving phage display and functional periplasmic 
expression. Nat. Biotech. 16, 376-380.). In a previous study, the present inventors showed 
that specific signal sequences for optimal display on phage of the Taq DNA polymerase I 
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Stoffel fragment can be isolated from a library of more than 10^ signal sequences derived 
from pelB (Jestin, J.L., Volioti, G. and Winter, G. (2001). Improving the display of proteins 
on filamentous phage. Res. Microbiol. 152, 187-191). Signal sequences, once translated, are 
recognized by the bacterial protein export machinery. The polypeptide is then exported in the 
bacterial periplasm before cleavage of the signal peptide by the signal peptidase, thereby 
releasing the mature protein, 

A short sequence, m (SG4CG4; SEQ ID NO: 39), at the C-terminus of the signal 
sequence, was initially introduced as a potential cross-linking site of substrates on phage that 
may be usefiil for selections by catalytic activity. This glycine-rich sequence may also be 
important for preventing structure formation at the peptidase cleavage site or for defining two 
independently folding units in the pre-protein. The glycine-rich sequence may then improve 
the signal sequence processing and finally lead to a greater ratio of protein fusions on phage. 
The present inventors, therefore, evaluated the effect of a selected signal sequence on the 
display of proteins on phage, as well as the effect of the m sequence at the C-terminus of the 
signal peptide. 

In an embodiment of the present invention is a method of identifying thermostable 
mutant polymerases derived from the Stoffel fragment of Taq comprising 

a) packaging a vector in which a polynucleotide encoding a phage coat protein is 
fiised to a polynucleotide encoding a protein having at least 80% identity to SEQ ID NO: 26 
into a phage 

b) expressing the fusion protein; 

c) isolation (selection) of phage particles; 

d) infecting E. coli and incubating the infected E, coli; 

e) detecting the fusion protein; 

f) assessing polymerase activity. 
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In this method, evolutionarily advantageous mutants may be identified by repeating 
steps (b) - (f) 0 to 25 times, preferably 0-20 times, more preferably 1-15 times, a most 
preferably 2 to 10 times. The method comprising one cycle (repeated 0 times) is particularly 
adapted to high throughput screening, when steps are repeated from 3 to 7 times, the method 
is better adapted for classical emprirical screening. 

In a preferred embodiment, the phage coat protein has a sequence of SEQ ID NO: 39. 

By way of example. Applicants provide the following exemplary discussion of the 
phage-display method of the present invention and refer to Strobel et al, Molec. Biotech. 
2003, vol. 24, pp. 1-9, which is incorporated herein by reference in its entirety: 
The amino acid signal sequences are: 

pelB: MKYLLPTAAAGLLLLAAQPAMA (SEQ ID NO- 41)- 
17: MKTLLAMVLVGLLLLPPGPSMA (SEQ ID NO: 42); ' 
110: MRGLLAMLVAGLLLLPIAPAMA (SEQ ID NO: 43); and 
112: MRRLLVIAAGLLLLLAPPTMA (SEQ ID NO: 44). 

The present inventors goal was to increase the display of proteins at the surface of 
filamentous phages. As model proteins, the present inventors chose the catalytic domains of 
adenylate cyclases from E. coli (ACE) and from B. pertussis (ACS). The present inventors 
also examined the display of two different enzymes, an adenylate cyclase and the Stoffel 
fragment of Taq DNA polymerase I, incorporated into phage particles as single polypeptide 
fusion products with minor coat protein p3. In this work, the present inventors evaluated the 
effects of two signal peptides (pelB and 17) and of the short peptide (m; SEQ ID NO: 39) at 
the N-tenninus of the fiision of these enzymes with p3. One other construct, deriving from 
the selected signal peptide 1 12, is also mentioned here, and the data are summarized together 
with previously published data for the selected signal sequences 1 10 and 1 12 (2). 

The phage particles were produced by using a helper phage, KM13 (6), for assembly 
of the particles, and by using phagemids pHENl (5), pHEN117, and pHEN1112 (2) encoding 
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the p3 fusion proteins. These phagemid vectors differ in their signal sequence: pelB is from 
Erwinia caratovora pectate lyase B (7), whereas signal sequences 17, 110, and 112, were 
selected from a library of more than lO' signal sequences for optimal display of the Stoffel 
fragment on filamentous phage (2). For all 17 phagemids encoding the different fusion 
proteins described in this work, the present inventors observed standard titers of infective 
particles, which were all in the range of 1.4 x lo'°-7.8xlo"' phages/mL of culture medium. 
Furthermore, enzymatic activities were detected for all phage-cyclase particles by thin layer 
chromatography and by HPLC (data not shown). 

The efficiency of protein display on phage was evaluated through two approaches. 
The first makes use of the engineered helper phage KM13 (6) to measure the fraction of 
infective phage particles that display a fusion product. The p3 fusion protein provided by the 
phagemid and the p3 protein provided by the helper phage compete for incorporation into the 
phage particles. The helper phage p3 is engineered so as to contain a protease cleavage site 
between domains 2 and 3 of p3. In phage particles that contain only helper p3 copies, no full 
p3 copy is available for bacterial infection after protease treatment: the phage particles are 
noninfective. If a phage particle has incorporated a p3 fusion protein, one copy of the three- 
p3 domains remains after protease cleavage, and is sufficient for infection of ^. coli. The 
trypsin-resistant fraction of phage is therefore a measure of protein display on infective 
phages. With this method, the display of fusion proteins was found to vary over more than 
two orders of magnitude for each cyclase, depending on the signal sequence and on 
neighboring sequences. Among the phagemid vectors containing the selected signal sequence 
17, three of the four fusion proteins that the present inventors studied (AC-p3 and AC- 
Sto£fel-p3, where AC is the adenylate cyclase catalytic domain of ^. coli or B .pertussis) 
were remarkably well incorporated into phage particles: more than one phage particle out of 
ten displayed an enzyme. No more than one particle in 300 displayed the E. coli cyclase fused 
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to the Stoffel ftagment and to protein 3, and better display of this protein could not be found 
among the constructs tested. 

The peptide w, SG4CG4, at the iV-terminus of the mature fusion protein, was found to 
increase the display of 5. pertussis cyclase-polymerase fusion on phage, by 100-fold for 
signal sequence 17 and by 10-fold for pelB. For this fusion, the worst display ratios are 
significantly improved with peptide m. Display ofB. pertussis cyclase on phage was high in 
all cases, such that a marginal improvement due to the m peptide was found for signal 
sequence 17, and improvement within the limits of experimental error for pelB. Concerning 
the E. coli cyclase protein, peptide m decreases the latter's display by a factor of 30 to 40. For 
the E. coli cyclase-polymerase fusion, peptide m showed no significant effect with the signal 
sequence pelB and a small improvement with signal sequence 17. 

Significant effects of the signal sequence on phage display were detected for three of 
the four fusions in the present inventors' study: from 5- to about 20-fold improvements in 
display on phage were noted for substitution of pelB by signal sequence 17. In the case of the 
B. pertussis cyclase-p3 fusion protein, incorporation of the fusion protein into phage particles 
was high, whether the signal sequence was pelB, 17. or 1 12. Indeed, for the selected signal 
sequence 112, up to 40% of infective phage particles displayed an enzyme at the surface of 
filamentous phage. 

When two enzymes were simultaneously displayed on phage (either E. coli or B. 
pertussis adenylate cyclase and the Stoffel fragment polymerase), the present inventors noted 
that the incorporation of p3 fusion products was significantly reduced in most cases. 
Remarkably, about half of the infective phage particles displayed a B. pertussis adenylate 
cyclase-Stoffel fragment polymerase-i)3 protein fusion when the selected signal sequence 17 
and the short N-terminal peptide m were present in the construct. 
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The second approach to estimating the level of fusion proteins incorporated into 
phage particles relies on the detection of p3 domain 3 by a monoclonal antibody (8) after 
SDS-PAGE and Western blotting of denatured phage particles. These results are in 
accordance with the data the present inventors obtained by measuring the trypsin-resistant 
ftaction of infective phages. All fusion products expressed on phage and which correspond to 
a trypsin-resistant fraction of phage higher than 0. 1 are indeed observed by Westem blot 
analysis. 

The present inventors aim to direct the evolution of adenylate cyclases by in vitro 
selection using a chemistry involving filamentous phage. This should provide a tool for the 
engineering of adenylate cyclases as well as a strategy for the fimctional cloning of this class 
of enzymes. Recent in vitro selection methods for catalytic activity using phage display have 
been designed as affinity chromatography methods for the reaction product linked to the 
phageenzyme that catalyzed the reaction from substrate to product. These selection methods 
were established with enzymes such as nuclease (9), DNA polymerase (10), peptidase 
(1 1,12), peptide Hgase (13), and beta-lactamase (14). They require an efficient display of 
enzyme on phage and a method to link the substrate/product to phage-enzymes. 

In the work reported here, the present inventors investigated the display of adenylate 
cyclases from B. pertussis and from E. coli on filamentous phage, and the display of two 
independent enzymes, an adenylate cyclase and the Taq DNA polymerase I Stoffel fragment. 
The Stoffel fragment (15) could be used as a tool to establish an in vitro selection for cyclase 
activity as follows: the polymerase domain may serve as an anchor of the substrate ATP on 
phage through double-stranded DNA used as a linker with a high affinity for the fiision 
protein. Another approach to cross-linking substrate and phage involves introduction of the 
thiol group of a cysteine residue within peptide m (SG4CG4), at the N-temiinus of the mature 
fiision protein and at the C-terminus of the fiision protein's signal sequence (10). 
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The signal sequences 17, 1 10, and 1 12, used in the present inventors' study had been 
selected from large libraries of pelB mutants for optimal display of the Stoffel fragment-p3 
protein fused to the peptide m (2). It was therefore important to further investigate which 
sequence context was essential for selection of these signal sequences, either the short peptide 
m or the entire gene. Interestingly, the present inventors found that the presence or the 
absence of this short peptide, SG4CG4, can yield up to 100-fold increases in the display of a 
fusion protein on filamentous phage. This strong effect was observed for the B. pertussis 
cyclase-Stofrel-p3 fusion as well as for the E. coli cyclase-p3 fusion in the case of the signal 
sequence 17 (Table 2). Of further note is that the signal sequences 17 and 1 12, yield generally 
better levels of protein display on phage than does pelB (Fig. 3). This improved display of 
proteins might be ascribed to the different targeting modes of the signal sequences. These 
selected signal sequences that improve the display of proteins on phage should therefore be 
useful in other systems. 

Our study highlights the important effects of the signal sequence and of a short 
peptide at the C-terminus of the signal sequence on the display of proteins on phage. Apart 
from the previously stated conclusions that the selected signal sequence 17 often yields an 
improved display as compared with pelB, and that sequence m can have drastic effects on the 
level of protein display, the set of protein fusions described here is not sufficient to define 
any further rules about sequences and optimal display of proteins on phage. Indeed, 
incorporation of a fusion protein into a phage particle is the result of a complex sequence of 
events involving fiision gene transcription and translation, folding, and export of the fusion 
protein, as well as cleavage of the signal sequence. 

Two approaches, however, can be envisaged for efficient display of proteins on 
bacteriophage. First, directed signal peptide evolution experiments can be undertaken for any 
defined protein so as to isolate a signal sequence for optimal display on phage. This approach 
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was described previously in the case of the Stoffel fragment of Taq DNA polymerase I (2). A 
more straightforward and quicker approach consists of the screening of several phagemid 
vectors that differ in their signal sequences and, more generally, in their regulatory 
sequences. In this report the present inventors have shown that for three of the four fusion 
proteins tested, excellent cyclase display levels can be obtained: more than one phage in ten 
displays an enzyme. Such display levels for large proteins should be useful for further 
approaches to directed protein evolution. 

With use of the phagemid strategy, almost every particle expresses a p3 copy 
provided by the phagemid if no gene fusion has been engineered or if the insert from the gene 
fusion has been deleted. On the contrary, about one phage particle in a thousand incorporates 
large fusion proteins such as cyclase-Stoffel fragment-p3 fusions. This indicates that for an 
equal mixture of two genes, thousand-fold differences in expression of the corresponding 
proteins on phage particles can be obtained. This bias may be of no importance if enrichment 
factors per selection round are much larger than 10\ but it may otherwise significantly alter 
the outcome of evolution experiments. Similar protein expression levels on phage of different 
genes would be useful to minimize biases introduced by successive amplifications in 
evolution experiments. The use of sets of phagemid vectors that differ by their signal 
sequences and by neighboring sequences might be of interest for better representation of 
protein libraries on filamentous phage. Additionally, the display of two distinct enzymes on 
single phage particles might be usefiil to direct their coevolution, especially in the case of two 
enzymes involved in the same metabolic pathway with an unstable reaction intermediate. 

By insertion or by deletion of the short peptide sequence SG4CG4 (m; SEQ ID NO: 
39) at the C-terminus of the signal sequence, the present inventors have shown that two 
enzymes can be very efficiently expressed as single polypeptides on the surface of 
filamentous bacteriophage by using the phagemid strategy. The model proteins described in 
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this Study are the catalytic domains of adenylate cyclases of B. pertussis or of E. coli, fused or 
not fused to the Stoffel-fragment DNA polymerase. 

On average, the present inventors found the best display levels for the selected signal 
sequence 17, which had been previously selected from a large library for optimal display on 
phage of the Stoffel fragment, and not for the commonly used signal sequence pelB. Yet the 
present inventors observed striking differences in display levels of these enzymes on the 
surfaces of phage particles, depending on the short N-terminal peptide m. The findings 
reported here should be useful for the display of large and of cytoplasmic proteins on 
filamentous phage particles, and more generally for protein engineering using phage display. 

The term "thermostable" enzyme refers to an enzyme that is stable over a temperature 
range of approximately 55°C to 105°C. In particular, thermostable enzymes in accordance 
with the present invention are heat resistant and catalyze the template directed DNA 
synthesis. Preferably, the activity of the thermostable enzymes of the present is at least 50% 
of activity, preferably at least 75%, more preferably at least 85%, of the wild-type enzyme 
activity over the same temperature range. In a particularly preferred embodiment, the 
thermostable enzyme of the present invention exhibits at least 50% of activity, preferably at 
least 75%, more preferably at least 85%, of the wild-type enzyme activity when said wild- 
type enzyme activity is measured under optimal conditions. Moreover, it is preferable that the 
"thermostable" enzyme does not become irreversibly denatured when subjected to the 
elevated temperatures and incubation time for denaturation of double-stranded nucleic acids, 
as well as the repetitive cycling between denaturation, annealing, and extension inherent to 
PCR-based techniques. 

As used herein, the term "reduced" or "inhibited" means decreasing the activity of 
one or more enzymes either directly or indirectly. The definition of these tenns also includes 
the reduction of the in vitro activity, either directly or indirectiy, of one or more enzymes. 
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The term "enhanced" as used herein means increasing the activity or concentration 
one or more polypeptides, which are encoded by the corresponding DNA. Enhancement can 
be achieved with the aid of various manipulations of the bacterial cell, including mutation of 
the protein, replacement of the expression regulatory sequence, etc. 

In order to achieve enhancement, particularly over-expression, the number of copies 
of the corresponding gene can be increased, a strong promoter can be "operably linked," or 
the promoter- and regulation region or the ribosome binding site which is situated upstream 
of the structural gene can be mutated. In this regard, the term "operably linked" refers to the 
positioning of the coding sequence such that a promoter, regulator, and/or control sequence 
will function to direct the expression of the protein encoded by the coding sequence located 
downstream therefrom. 

Expression cassettes that are incorporated upstream of the structural gene act in the 
same manner. In addition, it is possible to increase expression by employing inducible 
promoters. A gene can also be used which encodes a corresponding enzyme with a high 
activity. Expression can also be improved by measures for extending the life of the mRNA. 
Furthermore, preventing the degradation of the enzyme increases activity as a whole. 
Moreover, these measures can optionally be combined in any desired manner. These and 
other methods for altering gene activity in a plant are known as described, for example, in 
Methods in Plant Molecular Biology, Maliga et al, Eds., Cold Spring Harbor Laboratory 
Press, New York (1995). The definition of these terms also includes the enhancement of the 
in vitro activity, either directly or indirectly, of one or more enzymes. 

A gene (polynucleotide) can also be used which encodes a corresponding or variant 
polymerase having at least 80% identity to SEQ ID NO: 26. These gene(polynucleotides) 
can have various mutations. For example, a mutation of one or more amino acids in amino 
acids 738 to 767 of SEQ ID NO:26. Further examples of mutations include mutations at 
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positions M470, F472, M484, and W550 A331, and S335. In a preferred embodiment, these 
mutations are ASS IT, SS35N, M470K, M470R, F472Y, M484V, M484T, and W550R. In a 
particularly preferred embodiment, the polynucleotides of the present invention encode 
polypeptides having one or more of the aforementioned mutations and share at least 85% 
identity, at least 90% identity, at least 95% identity, or at least 97.5% identity to the 
polypeptide of SEQ ID NO: 26. Moreover, polynucleotides of the present invention encode 
polypeptides that have DNA polymerase activity and/or 5 '-3' exonuclease activity. More 
particularly, the polynucleotides of the present invention encode polypeptides that are capable 
of catalyzing the reverse transcription of mRNA. 

In the present invention, the polynucleotide may encode a polypeptide contain at least 
one mutation at a position selected from the group consisting of A331, L332, D333, Y334, 
and SS35. The polynucleotide may encode a polypeptide of the present invention which has 
amino acid sequence selected from the group consisting of SEQ ID NO: 20, SEQ ID NO: 22, 
SEQ ID NO: 24, SEQ ID NO: 28, SEQ ID NO: 30, SEQ ID NO: 32. SEQ ID NO: 34, SEQ 
ID NO: 36, and SEQ ID NO: 38. 

Within the context of the present application, the preferred polynucleotides possess a 
polynucleotide sequence selected from the group consisting of SEQ ID NO: 19, SEQ ID NO: 
21, SEQ ID NO: 23, SEQ ID NO: 27, SEQ ID NO: 29, SEQ ID NO: 31, SEQ ID NO: 33, 
SEQ ID NO: 35, and SEQ ID NO: 37. 

Within the scope of the present invention are also polynucleotides that are 
homologous to the aforementioned sequences. In the context of the present application, a 
polynucleotide sequence is "homologous" with the sequence according to the invention if at 
least 80%, preferably at least 90%, more preferably 95%, and most preferably 97.5% of its 
base composition and base sequence corresponds to the sequence according to the invention. 
It is to be understood that, as evinced by the Examples of the present invention and the 
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. phage-display method highlighted herein, screening of theoretical mutations within the scope 
of the present invention would require nothing more than a technician's level of skill in the 
art. More specifically, as is routine in the art, with the identification of a candidate sequence 
the artisan would assay and screen one or all possible permutations of the said sequence to 
identify mutants possessing the same or better DNA polymerase activity, reverse 
transcriptase activity, and/or 5 '-3' exonuclease activity. 

The expression "homologous amino acids" denotes those that have corresponding 
properties, particularly with regard to their charge, hydrophobic character, steric properties, 
etc. 

Homology, sequence similarity or sequence identity of nucleotide or amino acid 
sequences may be determined conventionally by using known software or computer 
programs such as the BestFit or Gap pairwise comparison programs (GCG Wisconsin 
Package, Genetics Computer Group, 575 Science Drive, Madison, Wisconsin 5371 1). BestFit 
uses the local homology algorithm of Smith and Watennan, Advances in Applied 
Mathematics 2: 482-489 (1981), to find the best segment of identity or similarity between 
two sequences. Gap performs global alignments: all of one sequence with all of another 
similar sequence using the method of Needleman and Wunsch, J. Mol. Biol. 48:443-453 
(1970). When using a sequence alignment program such as BestFit, to determine the degree 
of sequence homology, similarity or identity, the default setting may be used, or an 
appropriate scoring matrix may be selected to optimize identity, similarity or homology 
scores. Similarly, when using a program such as BestFit to determine sequence identity, 
similarity or homology between two different amino acid sequences, the default settings may 
be used, or an appropriate scoring matrix, such as blosum45 or blosumSO, may be selected to 
optimize identity, similarity or homology scores. 

The terms "isolated" or "purified" means separated fi-om its natural environment. 
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The term "polynucleotide" refers in general to polyribonucleotides and 
polydeoxyribonucleotides, and can denote an unmodified RNA or DNA or a modified RNA 
orDNA. 

The term "polypeptides" is to be understood to mean peptides or proteins that contain 
two or more amino acids that are bound via peptide bonds. A "polypeptide" as used herein is 
understood to mean a sequence of several amino acid residues linked by peptide bonds. Such 
amino acids are known in the art and encompass the unmodified and modified amino acids, 
hi addition, one or more modifications known in the art such as glycosylation, 
phosphorylation, etc may modify the polypeptide. 

The term "homologous" as used herein is understood to mean two or more proteins 
fi-om the same species or fi-om a different species. Within the meaning of this term, said two 
or more polypeptides share at least 80% identity to the polypeptide of SEQ E) NO: 26 and 
can have the mutations discussed herein. In a particularly preferred embodiment, the 
polypeptides of the present invention have one or more of the aforementioned mutations and 
share at least 85% identity, at least 90% identity, at least 95% identity, or at least 97.5% 
identity to the polypeptide of SEQ ID NO: 26. Moreover, the polypeptides of the present 
invention have DNA polymerase activity and/or 5 '-3' exonuclease activity. More 
particularly, the polypeptides of the present invention are capable of catalyzing the reverse 
transcription of mRNA. 

In the present invention, the polypeptide may contain one or more mutations, such as 
A331, L332, D333, Y334, and S335. The isolated polypeptide of the present invention has 
an amino acid sequence selected fi-om the group consisting of SEQ ID NO: 20, SEQ ID NO: 
22, SEQ ID NO: 24, SEQ ID NO: 28, SEQ ID NO: 30, SEQ ID NO: 32, SEQ ID NO: 34, 
SEQ ID NO: 36, and SEQ ID NO: 38. 
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In an embodiment of the present invention are mutations concerning alanine in 
position 331 (A331), and serine in position 335 (S335) that may have particular importance 
derived ftom the fact that they are surrounding the aspartic acid D in position 333 which is 
responsible for the chelation of Mn^* or Mg^^ . Thus, in one embodiment of the present 
invention, mutations of one or more amino acids 10 amino acids upstream and/or 10 amino 
acids downstream of this site are provided. 

The expression "homologous amino acids" denotes those that have corresponding 
properties, particularly with regard to their charge, hydrophobic character, steric properties, 
etc. 

Moreover, one skilled in the art is also aware of conservative amino acid replacements 
such as the replacement of glycine by alanine or of aspartic acid by glutamic acid in proteins 
as "sense mutations" which do not result in any fundamental change in the activity of the 
protein, i.e. which are functionally neutral. It is also known that changes at the N- and/or C- 
terminus of a protein do not substantially impair the fiinction thereof, and may even stabilize 
said function. As such, these conservative amino acid replacements are also envisaged as 
being within the scope of the present invention. 

The present invention also relates to DNA sequences that hybridize with the DNA 
sequence that encodes a corresponding or variant polymerase having at least 80% homology 
to SEQ ID NO: 26, the polypeptides having the mutations described herein. The present 
invention also relates to DNA sequences that are produced by polymerase chain reaction 
(PGR) using oligonucleotide primers that result from the DNA sequence that encodes a 
corresponding or variant polymerase having at least 80% homology to SEQ ID NO: 26, 
wherein the polypeptide has at least one mutation as described herein, or fragments thereof 
Oligonucleotides of this type typically have a length of at least 1 5 nucleotides. 



26 



Docket Number 248628US0X 



The terms "stringent conditions" or "stringent hybridization conditions" includes 
reference to conditions under which a polynucleotide will hybridize to its target sequence, to 
a detectably greater degree than other sequences (e.g., at least 2-fold over background). As 
used herein, stringent hybridization conditions are those conditions which allow hybridization 
between polynucleotides that are 80%, 85%, 90%, 95%, or 97.5% homologous as determined 
using conventional homology programs, an example of which is UWGCG sequence analysis 
program available from the University of Wisconsin. (Devereaux et al. Nucl. Acids Res. 12: 
387-397 (1984)). Stringent conditions are sequence-dependent and will be different in 
different circumstances. By controlling the stringency of the hybridization and/or washing 
conditions, target sequences can be identified which are 100% complementary to the probe 
(homologous probing). Alternatively, stringency conditions can be adjusted to allow some 
mismatching in sequences so that lower degrees of similarity are detected (heterologous 
probing). 

Typically, stringent conditions will be those in which the salt concentration is less 
than about 1.5 M Na ion, typically about 0.01 to 1.0 M Na ion concentration (or other salts) at 
pH 7.0 to 8.3 and the temperature is at least about 30 "C for short probes (e.g., 1 0 to 50 
nucleotides) and at least about 60 "C for long probes (e.g.. greater than 50 nucleotides). 
Stringent conditions may also be achieved with the addition of destabilizing agents such as 
formamide. Exemplary low stringency conditions include hybridization with a buffer solution 
of 30 to 35% formamide. 1 M NaCl. 1% SDS (sodium dodecyl sulfate) at 37 »C, and a wash 
in IX to 2X SSC (20X SSC=3.0 M NaCl/0.3 M trisodium citrate) at 50 to 55 <>C Exemplary 
moderate stringency conditions include hybridization in 40 to 45% formamide, 1 M NaCl. 
1% SDS at 37 »C, and a wash in 0.5X to IX SSC at 55 to 60 °C. Exemplary high stringency 
conditions include hybridization in 50% formamide, 1 M NaCl, 1% SDS at 37 •'C. and a wash 
in 0.1 X SSC at 60 to 65 »C. 
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Specificity is typically the function of post-hybridization washes, the critical factors 
being the ionic strength and temperature of the final wash solution. For DNA~DNA hybrids, 
the Tm can be approximated fiiom the equation of Meinkoth and Wahl, Anal. Biochem., 
138:267-284 (1984): Tm =81.5°C +16.6 (log M)+0.41 (%GC)-0.61 (% form)-500/L; where 
M is the molarity of monovalent cations, %GC is the percentage of guanosine and cytosine 
nucleotides in the DNA, % form is the percentage of formamide in the hybridization solution, 
and L is the length of the hybrid in base pairs. The Tm is the temperature (under defined ionic 
strength and pH) at which 50% of a complementary target sequence hybridizes to a perfectly 
matched probe. Tm is reduced by about 1 °C for each 1% of mismatching; thus, Tm, 
hybridization and/or wash conditions can be adjusted to hybridize to sequences of the desired 
identity. For example, if sequences with approximately 90% identity are sought, the Tm can 
be decreased 10»C. Generally, stringent conditions are selected to be about 5 "C lower than 
the thermal melting point (Tm) for the specific sequence and its complement at a defined 
ionic strength and pH. However, severely stringent conditions can utilize a hybridization 
and/or wash at 1, 2, 3, or 4 »C lower than the thermal melting point (Tm); moderately 
stringent conditions can utilize a hybridization and/or wash at 6, 7, 8, 9, or 10°C lower than 
the thermal melting point (Tm); low stringency conditions can utilize a hybridization and/or 
wash at 11, 12, 13. 14, 15, or 20 °C lower than the thermal melting point (Tm). Using the 
equation, hybridization and wash compositions, and desired Tm, those of ordinary skill will 
understand that variations in the stringency of hybridization and/or wash solutions are 
inherently described. If the desired degree of mismatching results in a Tm of less than 45 »C 
(aqueous solution) or 32 "C (formamide solution) it is preferred to increase the SSC 
concentration so that a higher temperature can be used. An extensive guide to the 
hybridization of nucleic acids is found in Current Protocols in Molecular Biology, Chapter 2, 
Ausubel, et al., Eds., Greene Publishing and Wiley-Interscience, New York (2000). 
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Thus, with the foregoing information, the skilled artisan can identify and isolated 
polynucleotides, which are substantially similar to the present polynucleotides. In isolating 
such a polynucleotide, the polynucleotide can be used as the present polynucleotide in, for 
example, to express a polypeptide having DNA polymerase activity and 5 '-3' exonuclease 
activity. 

One embodiment of the present invention is methods of screening for polynucleotides, 
which have substantial homology to the polynucleotides of the present invention, preferably 
those polynucleotides encoding a polypeptide having DNA polymerase activity and/or 5 '-3' 
exonuclease activity. 

The polynucleotide sequences of the present invention can be carried on one or more 
suitable plasmid vectors, as known in the art for bacteria or the like. 

Host cells usefiil in the present invention include any cell having the capacity to be 
infected or transfected by phages or vectors comprising the polynucleotide sequences 
encoding the enzymes described herein and, preferably also express the thermostable 
enzymes as described herein. Suitable host cells for expression include prokaryotes, yeast, 
archae, and other eukaryotic cells. Appropriate cloning and expression vectors for use with 
bacterial, fungal, yeast, and mammalian cellular hosts are well known in the art, e.g.. Pouwels 
et al. Cloning Vectors: A Laboratory Manual, Elsevier, New York (1985). The vector may 
be a plasmid vector, a single or double-stranded phage vector, or a single or double-stranded 
RNA or DNA viral vector. Such vectors may be introduced into cells as polynucleotides, 
preferably DNA, by well known techniques for introducing DNA and RNA into cells. The 
vectors, in the case of phage and viral vectors also may be and preferably are introduced into 
cells as packaged or encapsulated virus by well known techniques for infection and 
transduction. Viral vectors may be replication competent or replication defective, hi the 
latter case viral propagation generally will occur only in complementing host cells. Cell-free 
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translation systems could also be employed to produce the enzymes using RNAs derived 

from the present DNA constructs. 

Prokaiyotes useful as host cells in the present invention include gram negative or 

gram positive organisms such as E. coli or Bacilli, hi a prokaryotic host cell, a polypeptide 

may include a N-terminal methionine residue to facilitate expression of the recombinant 
polypeptide in the prokaryotic host cell. The N-terminal Met may be cleaved from the 
expressed recombinant polypeptide. Promoter sequences commonly used for recombinant 
prokaryotic host cell expression vectors include jS-lactamase and the lactose promoter system. 

Expression vectors for use in prokaryotic host cells generally comprise one or more 
phenotypic selectable marker genes. A phenotypic selectable marker gene is, for example, a 
gene encoding a protein that confers antibiotic resistance or that supplies an autotrophic 
requirement. Examples of useful expression vectors for prokaryotic host cells include those 
derived from commercially available plasmids such as the cloning vector pBR322 (ATCC 
37017). pBR322 contains genes for ampicillin and tetracycline resistance and thus provides 
simple means for identifying transformed cells. To construct an expression vector using 
pBR322, an appropriate promoter and a DNA sequence are inserted into the pBR322 vector. 

Other commercially available vectors include, for example, pKK223-3 (Pharmacia 
Fine Chemicals, Uppsala, Sweden) and pGEMl (Promega Biotec, Madison, Wisconsin., 
USA). 

Promoter sequences commonly used for recombinant prokaryotic host cell expression 
vectors include ^-lactamase (penicillinase), lactose promoter system (Chang et al. 
Nature275:615, (1978); and Goeddel et al.. Nature 281:544, (1979)), fryptophan (tip) 
promoter system (Goeddel et al.. Nucl. Acids Res. 8:4057, (1980)), and tac promoter 
(Maniatis, Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory, p. 412 
(1982)). 

30 



Docket Number 248628US0X 



Yeasts useful as host cells in the present invention include those from the genus 
Saccharomyces, Pichia, K. Actinomycetes and Kluyveromyces. Yeast vectors will often 
contain an origin of replication sequence from a 2/i yeast plasmid, an autonomously 
replicating sequence (ARS), a promoter region, sequences for polyadenylation, sequences for 
transcription termination, and a selectable marker gene. Suitable promoter sequences for 
yeast vectors include, among others, promoters for metallothionein, 3-phosphoglycerate 
kinase (Hitzeman et al., J. Biol. Chem. 255:2073, (1980)) or other glycolytic enzymes 
(Holland etal., Biochem. 17:4900, (1978)) such as enolase, glyceraldehyde-3 -phosphate 
dehydrogenase, hexokinase, pyruvatee decarboxylase, phosphofructokinase, glucose-6- 
phosphate isomerase, 3-phosphoglycerate mutase, pyruvate kinase, triosephosphate 
isomerase, phosphoglucose isomerase, and glucokinase. Other suitable vectors and 
promoters for use in yeast expression are further described in Fleer et al.. Gene, 107:285-195 
(1991). Other suitable promoters and vectors for yeast and yeast transformation protocols are 
well known in the art. 

Yeast transformation protocols are known to those of skill in the art. One such 
protocol is described by Hinnen et al.. Proceedings of the National Academy of Sciences 
USA. 75:1929 (1978). The Hinnen protocol selects for Trp.sup.+ transformants in a selective 
medium, wherein the selective medium consists of 0.67% yeast nitrogen base, 0.5% casamino 
acids, 2% glucose, 10 /ig/ml adenine, and 20 /ig/ml uracil. 

Mammalian or insect host cell culture systems well known in the art could also be 
employed to express recombinant polypeptides, e.g.. Baculovirus systems for production of 
heterologous proteins in insect cells (Luckow and Summers, Bio/Technology 6:47 (1988)) or 
Chinese hamster ovary (CHO) cells for mammalian expression may be used. Transcriptional 
and translational control sequences for mammalian host cell expression vectors may be 
excised from viral genomes. Commonly used promoter sequences and enhancer sequences 
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are derived from Polyoma virus, Adenovirus 2, Simian Virus 40 (SV40), and human 
cytomegalovirus. DNA sequences derived from the S V40 viral genome may be used to 
provide other genetic elements for expression of a structural gene sequence in a mammalian 
host cell, e.g., SV40 origin, early and late promoter, enhancer, splice, and polyadenylation 
sites. Viral early and late promoters are particularly useful because both are easily obtained 
from a viral genome as a fragment which may also contain a viral origin of replication. 
Exemplary expression vectors for use in mammalian host cells are well known in the art. 

The enzymes of the present invention may, when beneficial, be expressed as a fusion 
protein that has the enzyme attached to a fusion segment. The fusion segment often aids in 
protein purification, e.g., by pennitting the fusion protein to be isolated and purified by 
affinity chromatography. Fusion proteins can be produced by culturing a recombinant cell 
fransformed with a fiision nucleic acid sequence that encodes a protein including the fusion 
segment attached to either the carboxyl and/or amino terminal end of the enzyme. 

In one embodiment, it may be advantageous for propagating the polynucleotide to carry 
it in a bacterial or fimgal strain with the appropriate vector suitable for the cell type. 
Common methods of propagating polynucleotides and producing proteins in these cell types 
are known in the art and are described, for example, in Maniatis et al.. Molecular Cloning: A 
Laboratory Manual, Cold Spring Harbor Laboratory Press, New York (1982) and Sambrook 
et al., Molecular Cloning: A Laboratory Manual, Cold Spring Harbor Laboratory Press, New 
York (1989). 

In one embodiment of the present invention are monoclonal phages: 

1. SJL q deposited as CNCM 1-3168 in the Collection Nationale de Cultures de 
Microorganismes (CNCM) on February 27, 2004. 

2. SJL d deposited as CNCM 1-3169 in the Collection Nationale de Cultures de 
Microorganismes (CNCM) on February 27, 2004. 
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3. SJL I deposited as CNCM 1-3 1 70 in the Collection Nationale de Cultures de 
Microorganismes (CNCM) on February 27, 2004. 

4. SJL s deposited as CNCM 1-3171 in the Collection Nationale de Cultures de 
Microorganismes (CNCM) on February 27, 2004. 

5. SJL b deposited as CNCM 1-3172 in the Collection Nationale de Cultures de 
Microorganismes (CNCM) on February 27, 2004. 

6. SJL n deposited as CNCM 1-3173 in the Collection Nationale de Cultures de 
Microorganismes (CNCM) on February 27, 2004. 

7. SJL g deposited as CNCM 1-3174 in the Collection Nationale de Cultures de 
Microorganismes (CNCM) on February 27, 2004. 

8. SJL m deposited as CNCM 1-3175 in the Collection Nationale de Cultures de 
Microorganismes (CNCM) on February 27, 2004. 

9. SJL a deposited as CNCM 1-3176 in the Collection Nationale de Cultures de 
Microorganismes (CNCM) on February 27, 2004. 

In an embodiment of the present invention is a kit for amplifying DNA containing: 

- an isolated thermostable polypeptide, wherein said polypeptide has at least 80% 
homology to SEQ ID NO: 26, wherein said polypeptide has at least one mutation at a position 
selected from the group consisting of M470, F472, M484, and W550, more preferably 
selected from the group consisting of M470K, M470R, F472Y, M484V, M484T, and 
W550R, and wherein said polypeptide has DNA polymerase activity and 5 '-3' exonuclease 
activity; 

- a concentrated buffer solution, wherein when said concentrated buffer is admixed 
with the isolated polypeptide the overall buffer concentration is IX; 

- one or more divalent metal ions; and 

- deoxyribonucleotides. 



33 



Docket Number 248628US0X 

In this embodiment, the preferred divalent metal ion is Mg^* or Mn^*. In this 
connection, the concentration of the divalent metal ion ranges from 0.1 to 5 mM, preferably 
from 1 to 3 mM, more preferably from 2 to 2.5 mM. However, if the reaction is performed in 
a phosphate buffer, a buffer containing EDTA, or a buffer containing any other magnesium 
chelator, the concentration of magnesium may be increased to up to 100 mM. 

For the kit of the present invention the isolated thermostable polypeptide may be in a 
form selected from the group consisting of a lyophilized form, a solution form in a suitable 
buffer or carrier, and a frozen form in a suitable buffer or carrier. 

The kit of the present invention may also include a 5' to 3' exonuclease and/or a 3' to 
5' exonuclease. A preferred 5' to 3' exonuclease has a sequence as in SEQ ID NO: 50 (the 
DNA is in SEQ ID NO:60) and the 3' to 5' exonuclease as in SEQ ID NO: 51 (the DNA is in 
SEQ ID NO:61). 

With respect to the suitable buffer or carrier, the following components may be used: 
Tris-HCl, KCl, Triton-XlOO, dimethylsulfoxide, tetramethyl ammonium chloride, etc. 

In the present invention, the concentrated buffer solution corresponds to a stock 
solution that has a concentration ranging from 1.5X to lOX, where the concentration is 
measured in relation to the final reaction concentration (IX). To this end, the buffer solution 
(IX) contains the following components: 10 mM Tris-HCl, pH at 25°C of 9, 50 mM KCl, 
0.1%Triton-X100. 

For the kit according to the present invention, the stock concentration of the 
deoxyribonucleotides ranges from 50 ^M to 200 mM, preferably from 75 ^M to 150 mM, 
more preferably 100 ^M to 100 mM, for each dNTP. Moreover, the concentration of each 
dNTP in the PCR reaction according to the present invention should range from 10 ^iM to 
500 ^M, preferably from 25 ^iM to 400 ^iM. more preferably 50 yM to 300 \iM. As used in 
the present invention, the term "deoxyribonucleotides" includes: dATP, dCTP, dGTP, and 
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dTTP. It is to be understood that within the scope of the present invention, the kit may 

include in place of or in addition to the aforementioned components, RNA precursors, minor 

('*rare") bases, and/or labelled bases. 

In another embodiment of the present invention is a method of amplifying DNA from a 

culture and/or purified stock solution of DNA and/or mRNA by utilizing a thermostable 
polypeptide according to the present invention. To this end, protocols for conducting PCR 
and RT-PCR would be readily appreciated by the skilled artisan. However, for sake of 
completeness, the artisan is directed to the following exemplary references for protocols for 
conducting PCR and RT-PCR (See, for example, Rougeon, F, et al. (1975) Nucl. Acids Res., 
2, 2365-2378; Rougeon, F, et al. (1976) Proc. Natl. Acad. Sci. USA, 73, 3418-3422; Grabko, 
V. I., et al. (1996) FEBS Letters, 387, 189-192; and Perler, F., et al. (1996) Adv. Prot. Chem., 
48, 377-435) 

With reference to reverse transcribing an RNA, a preferred method includes: 

a) providing a reverse transcription reaction mixture comprising said RNA, a primer, 
a divalent cation, and an isolated thermostable polypeptide comprising an amino acid 
sequence having at least 80% homology to SEQ ID NO: 26, wherein said polypeptide has at 
least one mutation at a position selected from the group consisting of M470, F472, M484, and 
W550, more preferably selected from the group consisting of M470K, M470R, F472Y, 
M484V. M484T, and W550R, and wherein said polypeptide has DNA polymerase activity 
and 5 '-3 ' exonuclease activity in a suitable buffer; and 

b) treating said reaction mixture at a temperature and under conditions suitable for 
said isolated polypeptide to initiate synthesis of an extension product of said primer to 
provide a cDNA molecule complementary to said RNA. 

It is to be understood that the skilled artisan would appreciate that the thermal cycling 
should be optimized to account for variations in the enzyme selected, the template to be 
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reverse transcribed, the primers to be used to facilitate amplification (i.e., with respect to the 
melting and annealing temperatures), and the relative concentrations to be used for each of 
the reaction components. Such optimization is well within the purview of the skilled artisan; 
however, exemplary protocols may include the following: 
Table 2 : PGR protocols 





a 


D 


c 


d 


e 


#of 
repeated 
Gycles 


PGR 1 


94 °G. 3' 


94 °G, r 


66°G, 1' 


72°G, 2' 


72°G, 15' 


b-d = 30 


PGR 2 


94°G, 3' 


94 °G, 1 ' 


62°G, 1' 


72°G, 2' 


72°G, 15' 


b-d = 30 


PGR 3 


94°G, 3' 


94°G, 30" 


59°G, 30" 


72°G, r 


72°G, 15' 


b-d = 30 


PGR 4 


94 °G, 3' 


94 °G, 30" 


68°G, 1.5' 


68°G, 6' 




b-c = 35 


PGR 5 


94 °G, r 


94 °G, 30" 


70°G, 30" 


72°G, r 


72°G, 15' 


b-d = 25 


PGR 6 


94 °G, 3' 


94 °C, 30" 


59''G, 30" 


72°G, 1' 


72''G, 15' 


b-d = 35 


PGR 7 


94°G,3' 


94 °G,r 1 


58 °C, 1' 


72°G, 2' 


72°G, 15' 


b-d = 35 



Moreover, it is to be understood that contemplated in the present invention is that with 
the polypeptide of the present invention the skilled artisan would appreciate that the buffer 
components and buffer concentrations should also be optimized. To this end, in a preferred 
embodiment, the kit of the present invention may be utilized. 

As used above, the phrases "selected from the group consisting of," "chosen from," and 
the like include mixtures of the specified materials. 

In one embodiment of a method of obtaining a thermostable variant enzyme : 
provided. This method comprises the following: 

a) screening enzymes expressed at the surface of phage particles and 
identifying at least a thermostable variant conserving its active; 
catalytic domain at regulated temperature according to the method of identifying 
thermostable mutant polypeptides having a catalytic activity as described herein. 



is 
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b) isolating and sequencing a DNA encoding said identified 
thermostable variant; 

c) preparing a vector comprising the DNA of step (b); 

d) transfecting or infecting cells with the vector obtained at step c); 

e) expressing the thermostable variant enzyme from the 
cells and optionally, 

f) recovering, isolating and purifying said thermostable variant 
enzyme expressed at step (e). 

Where a numerical limit or range is stated herein, the endpoints are included. Also, all 
values and subranges within a numerical limit or range are specifically included as if 
explicitly written out. 

Having generally described this invention, a fiirther understanding can be obtained by 
reference to certain specific examples, which are provided herein for purposes of illustration 
only, and are not intended to be limiting unless otherwise specified. 

EXAMPLES 

Materials and methods 
Buffers- 
Buffer A 

50 mM Tris-HCl at pH 8.3 at 25X, 50 mM KCl, 10 mM MgCls, 0.5 mM 
spermidine, 10 mM dithiothreitol 

Buffer B(lx): 

20 mM Tris-HCl at pH 8.8 at 25°C, 10 mM KCl, 10 mM (NRa)2SO^, 2 mM 
MgS04, 0. 1 % Triton X- 1 00, 0. 1 g/1 BS A 
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Buffer C (Ix): 

10 mM Tris-HCl at pH 9.0 at 25°C, 50 mM KCl, 1.5 mM MgCh, 0.1% Triton 
X-100 

Synthesis of substrates for selection- 

Deoxyoligonucleotides were prepared by solid phase synthesis on a DNA synthesizer 
(Expedite™, Millipore). The 5'-maleimidyl derivatized primer TAA CAC GAC AAA GCG 
CAA GAT GTG GCG T (SEQ ID NO: 13) was synthesized as described previously (Jestin 
J.L., Kristensen P., Winter G., A method for the selection of catalysis using phage display 
and proximity coupling. Angew. Chem. Int. Ed. 1999, 38, 8, 1 124-1 127. ) purified on a C18 
reverse phase HPLC column, and characterized by electrospray mass spectroscopy 8998.4 / 
8999.9 (measured / calculated). 5-[-N-[N-(N-biotinyl-8-aminocaproyl)-y-aminobutyryl]-3- 
aminoanyl]-2'deoxy-uridine-5'-triphosphate (biotin-dUTP) was purchased from Sigma and 
the other deoxynucleotide triphosphates dATP, dCTP and dGTP were obtained fi-om Roche- 
Boehringer. 



Library construction- 
Three phagemids libraries were mixed for phage preparation. The first two libraries 
(I : Fsel/NotI and II : Pstl/Nhel) derive from mutagenic PGR ampUfication of the wild-type 
Taq gene in the presence of manganese [ I : reference(Fromant, Blanquet, Plateau, Anal. 
Biochem., 224, 347-353, 1995) with MnCb : 0.5 mM; II : reference (Cadwell, Joyce, PGR 
methods and amplifications. Mutagenic PGR, 3, S136-S140) with four distinct MnCb 
concentrations (0.5, 0.35, 0.25 and 0.125 mM)] using following primers (I) SEQ ID NO: 1 
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and SEQ ID NO: 2, PGR 1, or (II) SEQ ID NO: 3 and SEQ ID NO: 4, PGR 2 (for primers: 
see Table 1, and for cycle settings: see Table 2). 

The third phagemids library (III) was constructed by oligonucleotide assembly using 
the wild-type Taq gene. First, four PGR fragments were prepared using Taq polymerase 
(PGR 3, see Table2), the wild-type Stoffel fragment gene as template and the following 
primer pairs (5-6), (7-8), (9-10) and (11-2) in buffer C IX (for primers: see Table 1). 

After purification with the QIAquick PGR Purification kit (QIAGEN), the four PGR 

fragments were assembled in a second PGR round using the kit GC- Advantage obtained from 

Glontech under PGR 4 (see Table 2), using buffer D IX. The crude PGR product was then 

amplified by PGR using PGR 5 protocol, the GG-Advantage kit, and the primers 1 and 2 in 

buffer D IX. Subsequently, the product was purified using the QIAquick Gel extraction gel 

(QIAGEN). 

Buffer D IX 

40 mM Tricine-KOH (pH 9,2) 

ISmMKOAc 

3.5 mM Mg(OAc)2 

5 % DMSO 

3.75 ng/ml BSA 

0.005% Noninet P-40 

0.005% Tween-20 

After subcloning into pHENl vectors using restriction sites Fsel/NotI or Pstl/Nhel, 
1 . 1 X 1 0' distinct clones were obtained by electroporation in E. coli strain TG 1 . 
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Table 1 : Oligonuleotides and primers 



SEQ ID NO: 


Oligonucleotide sequences 


1 


TAACAATAGGCCGGCCACCCCTTC 


2 


GAGTTTTTGTTCTGCGGC 


3 


TTTAATCATCTGCAGTACCGGGAGCTC 


4 


1 TTCATTCTTGCTAGCTCCTGGGAGAGGC 


5 


CCG GCC ACC CCT TC(C AR/A VY)C TCA AC(C AR/A VY)CGG GAC CAG CTG GAA 

AG 


6 


1 GGA TGA GGT CCG GCA A( YT G/RB T) (YT G/RB T)AA T(YT G/RB T)GG TGC T CT 
1 TCA GCT T( YT G/RB T)GA GCT CCC GGT ACT GCA GG 


7 


1 CAA CCA GAC GGC CAC G(CA R/AV Y)AC GGG CAG GCT A(CA R/AV Y)AG CTC 
1 C(CA R/AV Y)CC CAA CCT CCA GAA CAT CC 


8 


CCG CCT CCC GCA C(YT G/RB T)CT TCA C( YT G/RB T)GG CCT CTA GGT CTG 

GCAC 


9 


CCT GCA GTA CCG GGA GCT C(CA R/AV Y)AA GCT GAA GAG CAC C 
(CA R/AV Y)AT T(CA R/AV Y)(CA R/AV Y)TT GCC GGA CCT CAT CC 


10 


GGA TGT TCT GGA GGT TGG G(YTG/RBT)GG AGC T(YTG/RBT)TA GCC TGC CCG 
T(YTG/RBT)CG TGG CCG TCT GGT TG 


11 


GTG CCA GAC CTA GAG GCC (CAR/AVY) GTG AAG (CAR/A VY) GTG CGG G AG 


12 


AAA UAC AAC AAU AAA ACG CCA CAU CUU GCG 


13 


TAA CAC GAC AAA GCG CAA GAT GTG GCG T 


14 


AAA TAC AAC AAT AAA ACG CCA CAT CTT GCG 


15 


TTCATTCTTGCTAGCTCCTGGGAGAGGC 


16 


GAG AAG ATC CTG CAG TAC CGG GAG C 


17 


GACCAAC ATCAAGACTGCC 


18 


TTGGCCAGGAACTTGTCC 
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Table 2 : PCR cycles 





a 


D 


c 


d 


e 


#of 
repeated 
Cycles 


PGR 1 






DO 1 


11 C, 2 


IT'C, 15 


b-d = 30 








OZ 1 


72 C, 2 


15 


b-d = 30 


PCR 3 


94°C, 3' 


94°C, 30" 


59<'C, 30" 


72°C, 1 ' 


u 




PCR 4 


94 °C, 3' 


94 °C, 30" 


68°C, 1.5' 


68°C, 6' 




b-c = 35 


PCR 5 


94 °c, r 


94 °C, 30" 


70°C, 30" 


72°C, r 


72°C, 15' 


b-d = 25 


PCR 6 


94°C,3' 


94 "C, 30" 


59°C, 30" 


72°c, r 


72°C, 15' 


b-d = 35 


PCR 7 


94 °C,3' 


94 °C, V 


58 "C, V 


72°C, 2' 


72°C, 15' 


b-d = 35 



Phage preparation and selection- 

For phage preparation, E. coli TGI transformed by the phagemid library and grown to 
an optical density of 0.3 at 600 nm were infected by a twenty-fold excess of helper phage. 
Phage particles were produced at 30°C for 19 hours in a 2xTY medium containing 100 mg/1 
ampiciUin, 25 mg/1 kanamycin. After removal of bacteria by two centrifiigation (4000 ipm, 
4°C), phage particles in the supernatant were purified by two precipitations in 4% 
polyethyleneglycol in 0.5 M NaCl, resuspended in 1 ml of PBS (pH 7.4), and dialyzed four 
times against PBS over a period of 24 hours. The pH of the final solution was raised to pH 8. 

The protocol for selection was as described previously (Jestin J.L., Kristensen P., 
Winter G. A method for the selection of catalysis using phage display and proximity 
coupUng. Angew. Chem. Int. Ed. 1999, 38, 8, 1 124-1 127; Vichier-Guerre S., Jestin J.L. 
Iterative cycles of in vitro protein selection for DNA polymerase activity, Biocat. & 
Biotransf. 2003, 21, 75-78), except that 10'° infectious phages particles were used after 
heating at 65°C for 5 minutes and that DNA polymerization was done at 65°C. 

Substrate cross-Hnking on phage was done by incubating the phage particles with 10 
MM maleimidyl-derivatized primer, 50 ^iM RNA template of SEQ ID NO: 12 in the presence 
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of 10 mM magnesium chloride at 37°C for 2 hours and polymerization during 2 minutes at 

65°C after addition of 3 \xM. biotin-dUTP and 1 joM dVTP. 

The reactions were blocked by addition of one volume of 0.25 M ethylene diamine 

tetra-acetate. The phage mixture was added to 200 ^1 of streptavidin-coated 

superparamagnetic beads (Dynabeads M-280, Dynal). After 30 minutes at room temperature, 

the beads were washed seven times and resuspended in 200 |j,l PBS. 

The phage-bead mixture was incubated for 10 min at 37 "C after addition of one- 
tenth, in volume, of trypsin (0.1 g/I). 1.8 mL of E.coli TGI was then added for infection 
during 25 min at 37 °C. Bacteria were plated on 530 cm^ Petri dishes (Coming). After 12 
hours at 30°C, bacteria were scraped from the plate with a 2xTY medium containing 
ampicillin and about 2x10^ cells were used for preparation of the phage particles. 

RT-polymerization and polvmeriz ation activity assay using phage-p olvmerasp 

In the following examples, the activity of the different mutant phage-polymerases was 
assayed by incorporation of radiolabeled dTTP. 



Example 1 - polyclonal phage-polymerases (Figure 1): 

In this example, the reverse transcriptase activity of phage-polymerases was assessed 
as obtained after different rounds of selection in the presence of Mg^^ or Mn^* ions. In these 
experiments, two reverse transcription (RT) mixes were used. The final concentration of each 
component in a reaction was: lO^M RNA (SEQ ID NO: 12); 5nM DNA (SEQ ID NO: 13); 
0.25 mM dNTP; 3 mM MgCb or 2.5 mM MnCla. 

Each 1.9 ^l aliquot of the reaction mix was fiirther added to 15 fil of phage- 
polymerases (10^ particles) after a given selection round heated for 5 min at 65°C. The 
solutions were then incubated at 37°C for 15 min. The reactions were stopped by adding 15 
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^il of EDTA/formamide containing denaturation solution, heating for 3 min. at 94°C, and 
placed on ice. The incorporation of alpha ^^P-dTTP was determined on 20% polyacrylamide 
gel; 15 nl of the final reaction volume were loaded. 



The lane designations in Figure 1 are as follows: 
MnCh 



a : phage-polymerases of round 6 

b : phage-polymerases of round 5 

c : phage-polymerases of round 4 

d : phage-polymerases of round 3 

e : phage-polymerases of round 2 

f : phage-polymerases of round I 

g : phage-polymerases of initial population 



MeCh 



h: phage-polymerases of round 6 

i : phage-polymerases of round 5 

j : phage-polymerases of round 4 

k : phage-polymerases of round 3 

1 : phage-polymerases of round 2 

m : phage-polymerases of round 1 

n : phage-polymerases of initial population 



This experiment demonstrated that: 

- A RT-activity is present using phage-polymerase obtained after round 5 (i) or 6 (h) of 
selection in presence of Mg^"*^. 

- A high RT-activity was detected at the round 3 (d) in the presence of Mn^^ and for further 
rounds. 



Example 2 - polyclonal phage-polymerases (Figure 2): 

In this example, the reverse transcriptase activity of phage-polymerases was assessed 
as obtained after different rounds of selection in the presence of Mg^^ ions. Jn these 
experiments, a reverse transcription (RT) mix was used. The final concentration of each 
component in a reaction was: lO^iM RNA (SEQ ID NO: 12); S^iM DNA (SEQ ID NO: 13); 
0.25mM dNTP; 3mM MgC^. 

Each 1.2 III aliquot of the reaction mix was fiirther mixed with 15 ^l of phage- 
polymerase polymerases (10^ particles) after one round of selection round, either not 
preheated or heated 5 min at 65X before reaction of polymerization. The solutions were then 
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incubated at 3TC for 15 min. The reactions were stopped by adding 15 \i\ of the denaturation 
solution, heating for 3 min. at 94''C and placing on ice. 

The incorporation of alpha ^^P-dTTP was determined on 20% polyacrylamide gel; 
15 ^il of the final reaction volume were loaded. The positive control was performed with 
addition of different concentration of commercial AMV reverse transcriptase (Promega). 

The lane designations in Figure 2 are as follows: 



Phage-polvmerase heated at 65 for 5 min. 

a : phage-polymerases of initial population 

b : phage-poljonerases of round 1 

c : phage-polymerases of round 2 

d : phage-polymerases of round 3 

e : phage-polymerases of round 4 

f : phage-polymerases of round 5 

g : phage-polymerases of round 6 



Phage-polvmerase not preheated 



h : phage-polymerases of initial population 

i : phage-polymerases of round 1 

j : phage-polymerases of round 2 

k : phage-polymerases of round 3 

1 : phage-polymerases of roimd 4 

m : phage-polymerases of round 5 

n : phage-polymerases of round 6 

o : control AMV-RT, 1 U 

p : control AMV-RT, 0.1 U 

q : control AMV-RT, 0.01 U 

r : control AMV-RT, 0.001 U 



This experiment demonstrated that: 

- A RT-activity is present using phage-polymerase obtained after round 5 or 6 of selection 
preheated for 5 min. at 65X (f and g) or not (m and n) as in Figure 1 in presence of Mg^^ 

- A high RT-activity was detected using 1 unit of AMV-RT (o) but no activity was detected 
using decreasing concentration of AMV-RT. 



Example 3 - monoclonal phage-polymerases (Figure 3): 

In this example, the reverse transcriptase activity of various monoclonal phage- 
polymerases obtained after round 6 in the presence of Mg^*' ions was assessed. In these 
experiments, a reverse transcription (RT) mix was prepared in which the final concentration 
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of each component in a reaction was: \0\iM RNA (SEQ ID NO: 12); SjiM DNA (SEQ E) 
NO: 13); 0.25mM dNTP; 3 mM MgCh- 

Each 1.45 \i\ aliquot of the reaction mix was further mixed with 15 ^l of phage- 
polymerase heated for 5 min at 65°C. The solutions were then incubated at 37°C for 20 min. 
The reactions were stopped by adding 15 ^l of denaturation solution, heating for 3 min. at 
94°C, and placed on ice. 

The incorporation of alpha ^^P-dTTP was determined on a 20% polyacrylamide gel; 
15 ^l of the final reaction volume were loaded. The positive control was performed using the 
AMV-RT (Promega), lane C. 

The different monoclonal phage-polymerases were obtained among the phage- 
polymerases of round 6. The phage-polymerases present various DNA-polymerase RNA- 
dependant activities. The lane designations in Figure 3 are as follows: s = SEQ ID NO: 38; a 
= SEQ ID NO: 20; d = SEQ ID NO: 24; g = SEQ ID NO: 28; C = AMV-RT; i = SEQ ID NO: 
30; m = SEQ ID NO: 32; n = SEQ ID NO: 34; b = SEQ ID NO: 22; and q = SEQ ID NO: 36. 

The clones a, b, and d possess a high RT-activity, which were further studied as 
reported in Figure 4. Randomly chosen clones fi-om the selected populations were assayed 
for monoclonal phage-polymerase reverse transcriptase activity and that further sequencing 
of the corresponding mutant genes revealed identical sequences (for example, 7 clones 
reported on the figure were found to have the same sequence noted a). 
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Example 4 - monoclonal phage-polymerases (Figure 4): 

In this example, the reverse transcriptase and the polymerase activities of monoclonal 
phage-polymerases obtained after the round 6 in the presence of Mg^^or Mn^* ions was 
assessed. In these experiments, the final concentration of each component in a reaction was: 
lO^iM RNA (SEQ ID NO: 12); 5\iM DNA (SEQ ID NO: 13); 0.25mM dNTP; 3 mM 
MgCl2 or 2.5 mM MnCh; and 

\m DNA (SEQ ID NO: 14); 1^M DNA (SEQ ID NO: 13); 0.25mM dNTP; 3 mM 
MgCl2 or 2.5 mM MnCl2 

2 \i\ aliquots of the reaction mix were further added to 15 ^1 of each phage- 
polymerase pre-heated for 5 min at 65°C. The solutions were then incubated at 37°C for 15 
min. The reactions were stopped by adding 15 ^1 of denaturation solution, heating 3 min. at 
94°C, and placed on ice. 

The incorporation of alpha "P-dTTP was determined on polyacrylamide gel; 15 ^1 of 
the final reaction volume were loaded. The positive control was performed using the phage 
Stoflfel fi-agment (e). 

The lane designations in Figure 4 are as follows: a = SEQ ID NO: 20; b = SEQ ID 
NO: 22; d = SEQ ID NO: 24; and e = SEQ ID NO: 26. 

Three families of phage polymerase were characterized among the phage-polymerases 
of round 6. 

- The phage-polymerases a and b present a high DNA-polymerase DNA-dependent 
activity, which is higher than that of Stofifel phage-polymerase. 

- The phage-polymerases b and d present a high DNA-polymerase RNA-dependent 
activity, which is higher than that of the Stoffel phage-polymerase e (not detectable, see 
figure) or than the phage-polymerase a, whatever the conditions in the presence of 
magnesium or of manganese. 
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- The phage-polymerase d shows a poor DNA-polymerase DNA-dependent activity, which 
is lower than the activity of the Stoffel phage-polymerase. 

Construction and overproducing clones- 
Three phagemids corresponding to clones a, b and d on Figure 4 were isolated from 
individual colonies of E. colt strain TGI . The plasmid DNA was prepared and purified using 
Wizard Plus miniprep kits. The phagemids were cleaved with Ncol and NotI restriction 
endonucleases. The fragments were dephosphorylated with alkaline phosphatase, purified on 
QIAgen QIAquick and ligated into expression vector pET-28b(+) (Novagen) that had been 
cleaved with Ncol and NotI and containing a sequence for the thrombin cleavage site 
between the NotI and Xhol restriction sites (GCGGCCGCACTGGTGCCGCGCGGCAGC 
CTCGAG; SEQ ID NO: 45). 

Recombinant plasmids were transformed in E. coli strain BL21 pLysS and plated on 
2 YT media with kanamycin and chloramphenicol. Correct plasmid constructions were 
initially identified by restriction analysis of plasmid miniprep. 

E. coli strain BL21, used as a host for recombinant plasmids to over produce the 
mutant RT-polymerase. was grown in 2YT medium supplemented with 10 ^g/ml kanamycin 
and 25 ^g/ml chloramphenicol to propagate plasmids and 1 mM of isopropyl-l-thio-P-D- 
galactopyranoside (IPTG) to induce production of enzyme. 

Purification of mutant RT-polymerases- 

Mutants were prepared from 500 ml batches of cells. 2YT media plus kanamycin and 
chloramphenicol was inoculated with bacteria (containing a recombinant plasmid) freshly 
picked on a plate and grown at 37 °C to an absorbance at 600nm of approximately 0.5. 
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Subsequently, IPTG was added to a final concentration of 1 mM and the cultures were 
allowed to further grow for 5 h. 

Cells were harvested by centrifugation at 15000 g and 4''C for 10 min., resuspended in 
30 ml of lysis buffer (50 mM Na2HP04, 300 mM NaCl, 5 mM imidazole, pH= 8), lysed 3 
times for 45 sec by ultrasoxmd. Cell debris were removed by centrifugation at 10000 g and 
4°Cfor 15min. 

Mutant RT polymerases were recovered fi-om this clarified lysate and purified using 
Ni-NTA agarose (QIAGEN). 



Example 5 - Purified mutant RT-polymerases a, b, and d used in polymerase chain reaction 
(Figure 5): 



After purification on Ni-NTA agarose, the mutant polymerases were dialyzed in 
buffer Tris 100 mM, pH = 8 and stored at 4°C. 

PGR mix 


Component 




Amoimt 


Buffers lOX (*) 




20 nl 


MgCl2 25 mM 




10^1 


primer 15 (50 ^M) 




4^1 


primer 16 (50 ^M) 




4 Hi 


dNTP 25 mM 




2jil 


Water 




157.5 Hi 


Template (Stoffel fragment gene) 




2 Hi 


Pfu polymerase (3U/^l) 




0.5 \i\ 



(*) See Buffer B composition above 

The PCR was performed using 1 9^1 of PGR mix and 0,6^1 of mutant-polymerase, 
b and d. 
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The lanes in the gel appearing in Figure 5 include the three clones corresponding to 
clones a, b and d on Figure 4. In addition, the positive control was performed using the 
Stoffel fragment polymerase e and commercial Taq DNA polymerase (Promega). The lanes 
in Figure 5 are as follows: 

lane 1 : Taq 

lane 2 : a = SEQ ID NO: 20 
lane 3 : b = SEQ ID NO: 22 
lane 4 : d = SEQ ID NO: 24 
lane 5: e = SEQ ID NO: 26 
lane 6 : Molecular weight marker 

Example 6 - Purified mutant RT-polymerases a, b, and d used in RT-polymerase chain 
reaction (Figure 6): 

The positive control was performed using the phage-polymerase of AMV-RT 
(Promega). 

These studies were performed using the three clones corresponding on clones a, b and 
d in figure 4. 

The reverse transcription was performed at 65°C during 1 h using the following 
conditions. 
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Control RT mix 



Component 




RNA from rabbit ^lohin (^\Qmi\\ 90 iia/m1 


1 111 


pnmer 17 (5 fiM) 


0.4 ^l 


primer 18 (5 ^M) 


0.4 nl 


buffer A (**)AMV-RT 5X 


3fil 


dNTP 2.5 mM 


0.8 Hi 


AMV-RT lOU/fil 


3^1 


water 


6.4 ^l 


(**) See buffer A composition above 





RT 


mix 


Component 




RNA from rabbit globin (sigma), 20 ^ig/ml 


1^1 


primer 17 (5 ^M) 


0.4 ^l 


primer 18 (5 ^M) 


0.4 


MgCl2 25 mM 


0.75 


buffer C(***) 


1.5 ^l 


dNTP 2.5 mM 


0.8 ^1 


mutant polymerase a, b, d 


3 ^1 


or the Stoflfel fragment e 




water 


7.15 


(***) See buffer C composition above 





50 



Docket Number 24i628US0X 



The PGR was performed using PGR 7 (see table 2) and following conditions. 

PGR mix 


^^omponeni 


Amount 


Buffer B lOx 




primer 17 (50 ^M) 


4^1 


primer 18 (50 ^M) 


4^l 


dNTP 


2^1 


water 


164.5 ul 


Tag DNA polymerase (5U/|xl) 


5 ^l 


Pfu polymerase (3U/jxl) 


0.5 ^il 



19 ^1 aliquot of the PGR mix was added to 1 ^1 of the RT reaction product. 

A RT-PGR product of 372 bp was detectable using mutant RT-polymerases b and d. 

The lanes in the gel appearing in Figure 6 include the three clones corresponding to 

clones a, b and d on Figure 4. In addition, the positive control was performed using the 

Stoffel fragment polymerase e and the commercial AMV-RT (Promega). 

The lanes in Figure 6 are as follows: 

lane 1 : : molecular weight marker 

lane 2 : control AMV-RT 

lane 3 : b = SEQ ID NO: 22 

lane 4 : a = SEQ ID NO: 20 

lane 5: e = SEQ ID NO: 26 

lane 6 : d = SEQ ID NO: 24 

Summary of the Taq Sequence Variants above- 
In the N-terminus of the purified proteins, the signal sequence is not taken in account, 
the peptide having the sequence MASG4GG4 (SEQ ID NO: 39) has been introduced upstream 
die sequence SPKA (amino acids 13-16 of SEQ ID NO: 26). which correspond to the Stoffel 
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fragment beginning (S being the amino acid occupying the position number 290 in the Taq 
polymerase sequence). 

In the C-terminus of the purified proteins, the sequence AAALVPRGSLEHa (SEQ ID 
NO: 40) comprising a site of cleavage by thrombin, as well as a polyhistidine tag has been 
introduced to facilitate further piuification of the protein. 



Mutations assessment 


sequence 


SEQ ID NO: 


M761V 


SEQ ID No. "s" 


38 


M761T, D547G, I584V 


SEQ ID No. "a" 


20 


W827R 


SEQ ID No. "m" 


32 


W827R, E520G, A608T 


SEQ ID No. "b" 


22 


W827R, A517V, T664S, F769S 


SEQ ID No, "g" 


28 


M747K, Q698L, P816L 


SEQ ID No. "n" 


34 


M747R, W604R, S612N, V730L, R736Q, S739N, N483Q, S486Q. 
T539N, Y545Q, D547T, P548Q, A570Q, D578Q, A597T 


SEQ ID No. "d" 


24 


F749Y, A568V 


SEQ ID No. "i" 


30 


F749Y, P550Q, R556S, V740E, V819A 


SEQ ID No, "q" 


36 



Numerous modifications and variations on the present invention are possible in light 
of the above teachings. It is, therefore, to be understood that within the scope of the 
accompanying claims, the invention may be practiced otherwise than as specifically 
described herein. 
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